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a. [x] A certified copy of the priority document is enclosed. 

b. [ ] The certified priority document will follow. 

8- [ ] Priority is hereby claimed under 35 USC § 1 1 9(e) based on provisional application 

Serial No. , filed , in . Please amend the specification by 
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of this paper is enclosed. 
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not otherwise paid by check. 
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p, p-CAROTENE 15,15-DIOXYGENASES, NUCLEIC 
ACID SEQUENCES CODING THEREFOR AND THEIR USE 

FIELD OF THE INVENTION 

The present invention concerns the cloning of P,P-carotene 15,15-dioxygenase 
(EC 1.13.11.21), the enzyme responsible for the cleavage of p-carotene leading to vitamin 
A. The term vitamin A as defined in the present invention comprises a class of compounds 
including retinal, retinol, 3-dehydroretinol, retinoic acid, the isomers from these 
compounds as well as retinylesters. Proteins having P,P-carotene 15,15'-dioxygenase 
activity and nucleic acid sequences coding therefore can be used in different fields 
including but not limited to diagnostics, the technical production of vitamin A, the 
generation of transgenic plants in order to produce vitamin A in fruits and vegetables, or 
gene therapy. 

SUMMARY OF THE INVENTION 

In one embodiment of the present invention, a polypeptide having p,p-carotene 
15,15-dioxygenase activity is provided. This polypeptide includes SEQ ID NO: 1 or a 
polypeptide having P,P-carotene 15,15'-dioxygenase activity and being at least 60% 
homologous to SEQ ID NO: 1 as determined by the Wisconsin Sequence Analysis Package 
GCG, Version 9.1 (1997). 

The present invention also includes a nucleic acid sequence encoding the 
polypeptide defined above, such as for example, SEQ ID NO: 2 or a fragment thereof. 
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Another embodiment of the invention is a primer for amplifying a gene coding for a 
protein having P,(3-carotene 15,15'-dioxygenase activity which includes a nucleic acid 
sequence as defined above. 

5 A probe is also provided for detecting a gene coding for a protein having P,p- 

carotene 15,15'-dioxygenase activity. This probe includes a nucleic acid sequence as 
defined above. 

A test kit is also provided for amplifying and/or detecting a gene or a fragment 
10 thereof coding for p,p-carotene 15,15'-dioxygenase. The test kit includes at least one 

primer as defined above. The test kit may also include at least one probe as defined above 
alone, or in combination with at least one primer according to the present invention. 

Another embodiment of the invention is an antibody which specifically reacts with 
15 a polypeptide as defined above. 

An immunoassay is also provided for the detection and/or quantification of p,p- 
carotene 15,15'-dioxygenase. This immunoassay includes at least one antibody as set forth 
above. 

20 

A process is also provided for the production of vitamin A. This process includes 
enzymatically cleaving p-carotene by a polypeptide as described above. 

Another embodiment is a method for introducing a p,P-carotene 15,15'- 
25 dioxygenase cDNA into a host cell. This method includes inserting a cDNA coding for a 
polypeptide as described above into a vector suitable for the host cell and introducing the 
vector into the host cell. 



A host cell is also provided. This host cell may be obtained by the method set forth 
above. The host cell includes a p,P-carotene 15,15-dioxygenase cDNA obtained from 
another species. 

5 Another embodiment of the invention is a polynucleotide which encodes (3,p~ 

carotene 15,15'-dioxygenase and includes the sequence of SEQ ID NO: 2. 

A vector is also provided which includes the sequence of SEQ ID NO: 2. A host 
cell is also provided which has been transformed with this vector. 

10 

The present invention also includes a polypeptide having p,p-carotene 15,15'- 
dioxygenase activity, which polypeptide contains the amino acid sequences of SEQ ID 
Nos: 1 or 4. 

15 A primer set is also provided for amplifying a polynucleotide encoding P,P- 

carotene 15,15'-dioxygenase. This primer set includes SEQ ID NO: 8 as a 5' primer and a 
SEQ ID NO: 9 as a 3' primer. Another primer set for amplifying a polynucleotide 
encoding P,P-carotene 15,15'-dioxygenase is also provided which includes a polyT/Not 
reverse primer and SEQ ID NO: 10 as a forward primer. 

20 

The present invention also includes a kit for amplifying and/or detecting a 
polypeptide or fragment thereof encoding P,P-carotene 15,15'-dioxygenase. This kit 
includes at least one primer selected from SEQ ID Nos: 8, 9, and 10. 

25 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the result from the last step of purification of p,P-carotene 15,15- 
dioxygenase from the small intestine of chicken. The SDS-PAGE pattern and p,p-carotene 
15,1 5'-di oxygenase activity of individual fractions from the gel permeation 
30 chromatography run are shown. On the gel, the protein A marked by an arrow correlated 
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best with the p,P-carotene 15,15'-dioxygenase activity. It was therefore chosen for further 
amino acid sequence analysis. The abbreviations have the following meaning: Std.: 
molecular weight standard; cone: concentrate loaded onto the gel permeation 
chromatography column. 

5 

Figure 2 shows schematically the transactivation assay in eukaryotic cells. cDNAs 
are transfected and expressed in MCF-7 cells. When incubated with P-carotene, a positive 
pool shows cleavage activity. The cleavage product retinal is further oxidized to retinoic 
acid (RA) which binds to the endogenous receptor. The receptor/ligand complex binds to 
1 0 the response element on the reporter plasmid and leads to an enhanced transcription of the 
luciferase gene. The luminescene signals are detected in a luciferase assay with a sensitive 
CCD camera. 

Figure 3 shows the cDNA sequence (SEQ ID NO: 2) for p,p-carotene 15,15'- 
1 5 dioxygenase which has a length of 3090 base pairs excluding the poly A tail. 1 32 base 
pairs are 5' nontranslating sequence, the coding sequence has 1578 base pairs and the 3' 
nontranslating sequence 1380 base pairs, respectively. A putative poly A signal is found at 
position 3073. 

20 Figure 4 shows the derived amino acid sequence (sequence ID 

No. 1) of p,p-carotene 15,15'-dioxygenase derived from chicken having 526 residues. The 
amino acid sequence is given in the one letter code. 

Figure 5 shows a comparison of the p,p-carotene 15,15-dioxygenase amino acid 
25 sequence (SEQ ID NO: 4) with a protein having the designation RPE65 (SEQ ID NO: 5) 
which was found by a sequence comparison in EMBL Genbank as the protein having the 
highest homology to the P,P-carotene 15,15-dioxygenase of the present invention. 
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Figure 6 shows two fractions of p,p-carotene 15,15'-dioxygenase eluted from a 
Co 2+ -chelate column. In lanes 1 and 2 two different fractions were loaded and lane 3 is a 
low range molecular weight marker. 

5 Figure 7 shows an HPLC analysis of an activity test of p,p-carotene 15,1 5'- 

dioxygenase which was cloned and expressed in E. coli. 

Figure 8 is a chromatogram demonstrating that the peak from Fig. 7 representing 
the only product of the enzymatic cleaving is retinal. 

10 

DETAILED DESCRIPTION OF THE INVENTION 

Vitamin A is essential for man and animal and is largely formed in most organisms 
from its precursor carotenoids which, by themselves, can only be formed in plants, in 

1 5 photosynthetic active microorganisms and some other microorganisms. Man and most 

animals (in particular herbivores and omnivores) are able to convert such carotenoids, also 
called provitamins A, enzymatically into vitamin A. The most important enzyme for this 
process is the p,p-carotene 15,15'-dioxygenase (EC 1.13.1 1.21). The enzyme is located in 
the cytosol and forms retinal from [3-carotene, as the principal substrate, in presence of 

20 oxygen according to scheme 1 : 



Scheme 1. 




The enzyme p,P-carotene 15,15'-dioxygenase is characterized by generating 2 mols 
25 retinal from 1 mol of P-carotene by central cleavage. But the enzyme is also able to convert 
a wide range of carotenoids into vitamin A-active compounds, as shown in scheme II: 
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R' = OH, O or any 



■V R ' R" * CHO. COOH. COOR-. ^ o „ 

C^Hg-CHO, C,H,-COOH. C 5 H s -COOR" or any 
whereas R'" = Alkyl or any 

Highest known enzymatic activity is found in the intestine of herbivores, especially 
in duodenum. In other tissues like liver, lung, kidney and brain p,p-carotene 15,15'- 
dioxygenase is also detectable. Starting in 1 955, many attempts have been undertaken to 
purify and characterize the enzyme by biochemical methods (Goodman (1965 and 1966), 
Fidge (1969), Laksmanan (1972), Sharma (1977) and Devery & Milborrow (British 
Journal of Nutrition (1994) 72, p. 397-414). However, none of these attempts has been 
successful. Specific activities of 600 pmol retinal formed/mg protein per hour have not 
been surpassed. 



In the course of the present invention it was possible to purify the chicken enzyme 
to such a degree that a partial amino acid sequence could be obtained. The enzyme was 
enriched 226-fold, yielding a specific activity of 2500 pmol/h/mg. On a polyacrylamide gel 

15 of fractions from the final gel filtration run 15 bands were visible after Coomassie blue 
staining. Two bands correlated with the enzymatic activity profile of the (3,p-carotene 15, 
15'-dioxygenase. With the first protein Edman sequencing and with the second MS 
spectroscopy was performed. Tryptic ingel digestion and subsequent microbore RP-HPLC 
peptide mapping in combination with MALDI-TOF MS and automated Edman degradation 

20 of this latter protein revealed 2 peptides of 1 1 and 1 8 amino acids. From this sequence 
information degenerate PCR primers were designed and synthesized. 
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With a PCR protocol a 51 bp (base pair) fragment was amplified within the longer 
peptide. From this sequence a homologous primer was synthesized and used in a second 
RT-PCR (reverse transcriptase - PCR) to amplify a 597 bp fragment. 

5 This cDNA fragment was radioactively labeled and used for the screening of two 

positive pools from a chicken expression library in order to isolate the full length cDNA 
coding for p\p-carotene 15,15'-dioxygenase. 

The positive pools were obtained from a cDNA library from chicken duodenum 
1 0 which was screened for {3,p-carotene 15,1 5'-dioxygenase activity in a cellular 

transactivation assay. By this strategy several positive cDNA pools were identified. By 
combining the two strategies the gene coding for p\P-carotene 15,15'-dioxygenase could be 
successfully cloned. 

15 It is an object of the present invention to provide a protein having the vitamin A 

producing activity of p,p-carotene 15,15-dioxygenase comprising an amino acid sequence 
which is identical or homologous to SEQ ID NO: 1 (shown in Fig. 4) whereby the degree 
of homology to SEQ ID NO: 1 is at least 60%. 

20 With the sequence of p\p-carotene 15,1 5'-dioxygenase isolated from chicken, 

corresponding proteins from different animals like swine, cow, goat, dog, rabbit, poultry, 
fish and humans can easily be obtained. Since the chicken sequence is known, suitable 
regions of the nucleic acid sequence can be selected as primers for a polymerase chain 
reaction with a suitable nucleic acid which allows an easy and rapid amplification of the 

25 gene coding for the protein. 

The present invention includes therefore not only proteins having an amino acid 
sequence identical to the sequence given in SEQ ID NO: 1 but also such proteins which 
have an amino acid sequence homologous to the SEQ ID NO: 1 . The degree of homology 
30 is, however, at least 60%, preferably 70%, more preferably 80%, such as for example, at 
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least 90%. Homology as defined in the present invention means that when the amino acid 
sequences of two proteins are aligned at least the given percentage is identical. The 
alignment of the amino acids is performed with the help of a suitable computer program 
which is commercially available, in particular, the Wisconsin Sequence Analysis Package 
5 GCG (Genetics Computer Group, University Research Park, Madison), Version 9.1, 1997. 
The remainder of the amino acids may be different. A homology of 90% for example 
means that 90% of the amino acids of the protein are identical compared with the amino 
acid sequence given in SEQ ID NO: 1 whereas 10% of the amino acids may be different. 
The proteins of the present invention have, however, the biological activity of p,(5-carotene 
10 15,15'-dioxygenase which is explained above in more detail. In the present invention, this 
protein may be derived from other sources, for example, from other mammals, such as 
from humans. 



As used herein, the terms "protein" and "polypeptide" are used interchangeably 
1 5 throughout. The terms "nucleic acid" and "polynucleotide" are likewise used 
interchangeably. 



The term "nucleic acid" is intended to include, without limitation, DNA, RNA, 
cDNA, and mRNA. As used herein, the DNA may be genomic, synthetic, or semi- 
20 synthetic. Moreover, the nucleic acids of the present invention include single-stranded and 
double stranded molecules. 



As used herein "derived from" means that the protein, polypeptide, and/or 
polynucleotide exists naturally in an organism, such as for example, a chicken. However, 
25 the polypeptides and polynucleotides of the present invention may be produced/obtained 
from any source. Thus, the present invention includes recombinant, synthetic and semi- 
synthetic proteins, polypeptides, and polynucleotides. 
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The compositions of the present invention are said to be "isolated," such as for 
example "isolated polypeptide," "isolated polynucleotide," etc. As used herein, the term 



"isolated" is intended to mean that the polypeptide or polynucleotide is purified or, at least 
partially purified as described, for example, in Fig. 1 . 



Another aspect of the present invention concerns nucleic acid sequences coding for 
5 a protein having the biological activity of p,p-carotene 15,1 5'-dioxygenase. A nucleic acid 
sequence coding for the enzyme derived from chicken is shown in SEQ ID NO: 2 (see 
Figure 3). The nucleic acid sequences of the present invention code for a protein of the 
present invention or a part thereof. Shorter nucleotide sequences suitable for PCR have a 
length of at least 20 bases, preferably at least 25 bases and most preferred at least 30 bases. 



The nucleic acid sequences of the present invention can be used as primers for the 
specific amplification of a gene or part thereof coding for p,p-carotene 15,15'-dioxygenase. 
Primers can also be used for the specific amplification of 5' nontranslating or 3' 
nontranslating sequences of the cDNA described above. The nucleic acid sequences of P,p- 
1 5 carotene 15, 1 5'-dioxygenase cDNA can be used as a probe for the detection of the coding 
as well as for the noncoding regions or parts thereof. The nucleic acid sequences of the 
present invention can be used as antisense RNA probes for in situ hybridization. 

It is especially preferred to use primers and probes having a part of the sequence 
20 given in SEQ ID NO: 2 as primers and/or probes in test kits which can be used for the 
amplification and/or detection of genes/mRNAs coding for p,(3-carotene 15,15'- 
dioxygenase by the polymerase chain reaction (PCR). The selection of suitable parts of the 
nucleic acid sequence can be performed by the person skilled in the art without difficulties. 
A nucleic acid sequence used as a primer or probe is usually selected from a region which 
25 is highly conserved within the protein. Conserved means that the nucleic acid sequences of 
such regions of proteins obtained from different species are very similar. 



On the other hand the preferred nucleic acid sequence should not be present in other 
nucleic acid sequences which do not code for p,P-carotene 15,15'-dioxygenase, because 
30 this might lead to false positive results. By aligning several sequences derived from 
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different species such regions can easily be determined. Although the nucleic acid can be a 
ribonucleic acid it is more preferred to have deoxyribonucleic acid sequences. 

One preferred use in diagnostics is the detection of the presence of p,p-carotene 
5 1 5,1 5'-dioxygenase in patients. There is variability in p-carotene cleavage potential among 
the human population. Humans with low dioxygenase levels (with e.g. mutations or 
polymorphisms in the gene for p\P-carotene 15,15-dioxygenase) could be identified and 
selected for vitamin A supplementation. 

1 0 A diagnostic kit based on PCR can be designed to detect frequent mutations in the 

dioxygenase gene. Another diagnostic option is quantification of mRNA by RT-PCR. With 
this diagnostic tool differences in expression of p, p-carotene 15,15'-dioxygenase in various 
tissues and in different species can be found. 

1 5 Since the protein has been expressed and a method for purifying the protein is 

described in detail in the examples the person skilled in the art can use the protein or 
peptides derived from the amino acid sequences in order to generate antibodies which 
specifically react with the protein. It is either possible to produce polyclonal antibodies by 
immunizing laboratory animals, like rabbits, sheep or goats preferably with an adjuvant or 

20 monoclonal antibodies by the well-known technique described by Kohler and Milstein 
(European Journal of Immunology, 1976, 6 (7), p. 511-519). The antibodies should 
specifically react with p, P-carotene 15,15-dioxygenase in order to avoid an unspecific 
crossreaction. This means that the antibodies of the present invention should preferably 
react with an epitope which is present only on a protein of the present invention. 

25 

Such antibodies can be preferably used in immunoassays for the detection and/or 
quantification of P,P-carotene 15,15-dioxygenase in a test fluid. The test fluid may be a 
liquid, like serum, obtained from a patient. There are several types of immunoassays which 
are well-known to the person skilled in the art. Very frequently one antibody, preferably a 
30 monoclonal antibody is fixed to a solid phase. This antibody is then brought into contact 
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with the fluid containing the p,P-carotene 15,15-dioxygenase and after washing it is 
further reacted with a second monoclonal antibody which binds to another epitope of the 
enzyme. The second antibody is usually labeled and shows the presence of the sandwich 
consisting of the antigen and two different antibodies. 

5 

The antibodies can also be used in laboratory methods like Western blots or 
immuno-precipitations. Preferably such antibodies can be used in immunohistochemistry 
to detect epitopes of p,(3-carotene 15,15'-dioxygenase in embedded or fixed tissues or cells 
of any species of interest. 

10 

In a further embodiment of the present invention the p,P-carotene 1 5,15'- 
dioxygenase is used for the production of vitamin A whereby the enzyme cleaves 
enzymatically P-carotene into two molecules of retinal which will subsequently be reduced 
by retinol dehydrogenase to vitamin A. The P,(3-carotene 15,15'-dioxygenase can be used 

15 to enzymatically convert p-carotene which may be obtained from plant sources. A 
preferred source of P-carotene is the alga Dunaliella bardawil which has a high 
endogenous level of P-carotene. Suitable algae can be grown conveniently and P-carotene 
can be purified therefrom at rather low cost. The carotene can be conveniently cleaved 
enzymatically by using a protein of the present invention. The carotene dioxygenase can 

20 preferably be immobilized in order to provide a continuous process. 

Another aspect of the present invention concerns the introduction of the gene 
coding for a protein having p,P-carotene 15,15'-dioxygenase activity into a suitable host 
cell. The first step in such a method is usually to insert the cDNA into a suitable vector. 
25 The vector must fit with the host cell into which the gene should be introduced. There are 
specific vectors available for bacteria, yeasts, plant cells, insect cells or mammalian cells. 
Preferably the gene is combined with genetic structures which provide the required genetic 
regulation like promotors, enhancers, ribosomal binding sites etc. 
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Systems for the expression of genes encoding carotenoid biosynthetic enzymes in 
procaryotes, especially in E. coli or Bacillus subtilis or Flavobacter and eucaryotes, e.g. 
fungi are known in the art and described e.g. in EP Publication Nos. 747 483 or EP 872 
554. 

5 

The vector having the gene and the other required genetic structures is then 
introduced into suitable host cells by well-known methods like transformation, 
transfection, electroporation or microprojectile bombardment. Depending on the host cell it 
may be preferred to stably integrate the gene coding for a protein of the present invention 
10 into the genome of the host cell. The cells obtained by such methods can then be further 
propagated and if the cell is a plant cell it is possible to generate therefrom transgenic 
plants. 

In one embodiment of the present invention the host cells are plant cells and tomato 
1 5 cells are especially preferred. The technology to produce transgenic tomatoes is well- 
established and the tomato contains sufficient [3-carotene in order to come up with a 
reasonable vitamin A level after introduction of carotene dioxygenase into the tomato 
plant. In green pepper, melon or especially carrot the endogenous level of (3-carotene is 
even higher and therefore also these plants are especially preferred. 

20 

Another preferred embodiment of the present invention concerns algae. 
Halotolerant algae may contain high levels of p-carotene. A transfection of such algae with 
an expression vector comprising the (3,p-carotene 15, 15 '-dioxygenase cDNA leads to a 
high intracellular vitamin A level which can easily be recovered from such algae by simple 
25 purification steps. 

In another aspect of the present invention a gene coding for a protein of the present 
invention can be introduced into mammalian cells and especially into human cells. It is for 
example possible to insert the gene coding for a p,(3-carotene 15, 15 '-dioxygenase into 
30 suitable cells, for example peripheral blood stem cells. Such cells which contain the gene 
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for p,p-carotene 15,15'-dioxygenase may be administered to people having mutations or 
deletions in the p,p-carotene 15,15'-dioxygenase gene. Such mutations and deletions, 
respectively, may have the effect that such patients are not able to cleave p-carotene 
enzymatically. Therefore, such patients always have a low vitamin A level and thus suffer 
from various developmental and ophthalmological problems. The administration of 
suitably transfected cells expressing the p,p-carotene 15,15'-dioxygenase to such patients 
by way of somatic gene therapy is a way to improve their situation. 

The following examples are provided to further illustrate methods of preparation of 
the enzyme of the present invention, as well as certain physical properties and uses thereof. 
These examples are illustrative only and are not intended to limit the scope of the invention 
in any way. 

EXAMPLES 

EXAMPLE 1 
Assay of B.p-carotene lSJS'-dioxygenase activity 

For the tests the following solutions were prepared: 

a) Solution 1 (mixed micelle solution): Glycocholic acid (1.16 g) was dissolved in 5 ml 
H2O under stirring and by drop-wise addition of 5 N NaOH. After the pH was 
adjusted to 6.8-7.2 with acetic acid and the volume increased to 10 ml with H2O, 80 
mg of asolectin (Fluka) were added and dissolved under stirring. 

b) Solution 2 (substrate solution): 500 ul of an a-tocopherol solution (10 mg/ml in 
hexane) and 235 ug of a p-carotene solution (80 ug/ml of pure all-E-p-carotene in 
benzene) were mixed in a glass vial, protected from light and the solvents evaporated 
under a gentle nitrogen stream. 1 ml of solution 1 was added under vortexing and 
eventually a few short ultrasonic bursts until a clear solution occurred. 
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c) Solution 3 (homogenization buffer): 100 mM KH2PO4 adjusted with 5 N KOH to pH 
7.8 and containing 4 mM MgCl 2 , 6 H 2 0 and 30 mM nicotinamide. 

d) Solution 4 (GSH solution): 60 mg/ml reduced glutathione dissolved in solution 3. 

5 

e) Solution 5 (standard solution): 1 0 \ig/m\ vitamin A acetate in hexane/chloroform 9: 1 . 
Activity Assay 

2 ml of the enzyme preparation (approximately 4 mg protein, assayed by BCA 

10 protein assay, Pierce Chemicals) were placed in a light-protected glass vial in a shaking 

water bath (30 minutes, 37 °C). 0.2 ml of solution 4 was added and the reaction was started 
after 2 minutes of temperature equilibration by addition of 50 ul of solution 2. After 3 
hours, the reaction was stopped by placing the vials on ice and subsequent addition of 1 ml 
acetonitrile followed by 5 ml chloroform. The vials were vortexed 3 times for 7 seconds 

15 and phase separation was obtained by centrifugation for 5 minutes at 5000 g. Extraction 
was repeated twice with 0.6 ml chloroform. The combined chloroform phases were 
evaporated and resolubilized in 200 ul solution 5 under short sonication. Insoluble material 
was removed by filtration through 0.45 um filters. An aliquot of 20 ul was separated by 
HPLC on a reversed phase Ci8 column (Lichrospher 100, 5 fim, 12.5 cm x 4.6 mm, 

20 Bischoff Chromatography, Leonberg, Germany; 1 ml/min, column temperature 25°C) with 
a discontinuous, optimized gradient of acetonitrile/tetrahydrofuran/(l% ammonium acetate 
in H2O) from 50:20:30 (eluent A) to 50:44:6 (eluent B). These conditions allow complete 
separation of p,(3-carotene and retinal as well as apo-p-carotenals and retinoic acids. 
Calibration curves were made for both p\f3-carotene and retinal in the concentration ranges 

25 2-40 ng/ul and 1-10 ng/ul, respectively, and were correlated to the value of vitamin A 
acetate which served as an internal standard. Enzymatic activity was expressed as the 
amount of retinal liberated in the activity assay during 3 hours of incubation at 37 °C 
(100%= 17.6 nmol). 

30 EXAMPLE 2 
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Purification of BJ3-earotene 15.15'-dioxvff enase 



Purification was done as rapidly as possible, and all buffers and equipment were 
cooled to 4 °C. 

Solution 6 (protease inhibitor-containing homogenization buffer): 125 mM 
benzamidine-HCl, 250 mM 6-aminocaproic acid and 125 uM soybean trypsin inhibitor 
were dissolved in H2O by sonication. A 4 ml-aliquot of this solution was mixed with 
100 ml of solution 3. 

Solution 7: 10 mM KH2PO4, 1 mM reduced glutathione, pH 7.8. 

Solution 8 (eluent A, phenyl-Sepharose chromatography): 10 mM KH2PO4, 1 mM 
reduced glutathione, 0.5 M (NH4)2S04, pH 7.8. 

Solution 9 (eluent B, phenyl-Sepharose chromatography): 10 mM KH2PO4, 1 mM 
reduced glutathione, 10% glycerol, pH 7.8. 

Solution 10 (eluent B, Poros HQ chromatography): 10 mM KH2PO4, 1 mM 
reduced glutathione, 0.5 M NaCl, 10% glycerol, pH 7.8. 

Solution 1 1 (elution buffer for gel permeation chromatography): 50 mM KH2PO4, 
1 mM reduced glutathione, 150 mM NaCl, 10% glycerol, pH 7.8. 

Laying hens at an age of 20-24 weeks (strain Lohmann LSL, Hatchery Wuethrich, 
CH-3123 Belp, Switzerland) were kept on a pigment-free chicken diet (Kliba 3179, Kliba, 
CH-4303 Kaiseraugst, Switzerland). The animals were killed by decapitation and the first 
20 cm of the duodenal loop was removed, separated from pancreas and rinsed with 40 mL 
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each of 0.9% NaCl solution. The intestines (duodenal loops) were immediately frozen in 
dry ice and stored at -80°C until use. 

Ten intestines (duodenal loops) were thawed on ice in approximately 2 hours and 
5 opened length-wise in an ice cooled Petri dish. The mucosa was scraped off with a slide, 
weighed and homogenized in a Teflon-glass Potter-El vehj em homogenizer in 4 volumes of 
solution 6 with six strokes. Upon centrifugation at 62000 g for 1 hour, the clear supernatant 
was divided into 32 aliquots of 15 ml each. From these preparations an ammonium 
sulphate fractionation was made. The precipitate obtained from the 20-45% step was 
10 centrifugated at 5000 g for 10 minutes and the pellet was stored at -80°C for further use. 

Ten aliquots of the (NH4)2S04 pellet were dissolved in 150 ml of solution 7, 
sterile-filtered and loaded on a HiLoad 26/10 phenyl- Sepharose High Performance column 
(column volume 53 ml; Pharmacia, Uppsala, Sweden) and equilibrated with solution 8. 

1 5 Proteins were eluted at a flow rate of 8 ml/minute with a steep gradient over 1 column 
volume (CV) from solution 8 to solution 9. p,(3-Carotene 15,1 5 '-di oxygenase eluted at a 
conductivity of < 15 mS/cm, but only fractions with a conductivity of < 1 mS/cm were 
pooled and directly loaded onto a 30 ml Blue Sepharose 6 Fast Flow column (Pharmacia) 
equilibrated with solution 9. p,p-Carotene 15,15'-dioxygenase activity eluted (at a flow rate 

20 of 8 ml/min) in the break-through fractions which were (again) directly loaded onto a 20 
ml Poros HQ/M anion exchange chromatography column (PerSeptive Biosystems, 
Framingham, MA, USA) equilibrated with solution 9. p,p-Carotene 15,15-dioxygenase 
was eluted at a flow rate of 15 ml/minute with a linear gradient over 1 8 CV from solution 9 
to solution 10. Activity was detected in the gradient in a conductivity range of 10-20 

25 mS/cm. The pooled fractions (70 ml) were concentrated to ~1 .3 ml with Ultrafree-1 5 filter 
units (MW cut-off 50,000; Millipore, Bedford, MA, USA). An aliquot of the concentrate 
(500 ul) was loaded onto a Superdex 200 HR 10/30 gel filtration column (CV 24 ml; 
Pharmacia) and eluted at a flow rate of 0.5 ml/minute with solution 11. Aliquots of each 
fraction were used for activity assays (see example 1) and, upon concentration, for SDS- 

30 PAGE (with MOPS running buffer) on 1 0% NuPAGE gels (Novex, San Diego, C A, USA). 
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The results of this experiment are shown in Fig. 1 and Table 1 . 



TABLE 1 

Purification Total protein Total activity Yield Specific Purification 

step (mg) (nmol/h) (%) activity factor 

(pmol/(h-mg) 

) 



(NH4)2S04 


779 


8.61 


100 


11.0 




pellet 












phenyl- 


80.8 


8.27 


96.1 


102 


9.27 


Sepharose 












Blue 


16.0 


8.86 


103 


554 


50.1 


Sepharose 












Poros HQ 


1.56 


3.90 


45.3 


2500 


226 



5 

Table 1: Summary table for the purification of p\(3-carotene 15,15'-dioxygenase starting 
with 10 aliquots of the (NH4)2S04 pellet (means of 3-4 measurements). 

EXAMPLE 3 

10 

Amino acid sequence information for fl,p-carotene lSilS'-dioxvgenase 

For amino acid sequence analysis, fractions of the gel filtration run (as shown in 
Fig. 1) were separated on a 8-16% Tris/glycine gel (Novex), and the proteins transferred to 
15 an Immobilon P$Q membrane (Millipore) and stained with amido black. 

Since protein A proved to be N-terminally blocked, multiple aliquots of fraction 18 
from the gel filtration run (see Fig. 1) were separated on a 10% Tris/glycine gel (Novex), 
and the gel was stained with Colloidal Coomassie Blue (Novex). The band corresponding 
20 to protein A was excised from the gel, and the protein digested in-gel with trypsin. The 
tryptic digest was separated by micro-bore re versed-phase HPLC onalSOxl.O mm 
Vydac C\g column (Vydac, Hesperia, CA, USA). Peptides were eluted with an acetonitrile 
gradient in 0.1% trifluoroacetic acid, and peptide containing fractions were collected for 
further analysis. Two fractions were identified by MALDI-TOF-MS (Voyager Elite, 
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PerSeptive Biosystems) to contain one single peptide each. N-terminal Edman degradation 
revealed the following sequences: 



(1) Ala-Glu-Val-Gln-Gly-Gln-Leu-Pro (Seq. ID No. 3) 
5 (2) Asn-Lys-Glu-Glu-His-Pro-Glu-Pro-Ile-Lys-Ala-Glu-Val-Gln-Gly-Gln-Leu-Pro 
(Seq. ID No. 6) 

Note that the last 8 amino acids of peptide (2) correspond to peptide (1). 

10 

EXAMPLE 4 

Cloning the full length cDNA for the p\(3-carotene 15,15'-dioxygenase 



15 A) RN A isolation: 



A 4 week old Vedette chicken was killed, the duodenum was removed, washed 
with sterile PBS and cut open with scissors. The mucosal layer was scraped off with a 
glass slide, weighed and homogenized immediately with a Polytron in 1 ml of Trizole 
20 reagent (Life Technologies) per 1 00 mg of tissue. Then the standard protocol from Life 
Technologies was followed. Poly A mRNA was isolated by the polyATtract mRNA 
Isolation kit from Promega Corporation, Madison. 



B) PCR and RT-PCR: 

In the peptide sequence NKEEHPEPIKAEVQGQLP (peptide 2 of Example 3) 
(Seq. ID No. 7) two degenerate primers were designed: In order to have a lower 
degeneracy the base Inosin was used in one and in two wobble positions, respectively. 



30 5' primer: 5'AAC AAR GAR GAS CAY CCI GA 3' (Seq. ID No. 8) 
(20 mer with a degeneracy of 16x) 
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3' primer: 5' SAG CTG ICC CTG I AC YTC SGC 3' (Seq. ID No. 9) 
(21 mer with a degeneracy of 8x) 

R= A or G, S= C or G, Y= C or T 

5 

The oligos were synthesized on a Pharmacia Gene Assembler Plus using standard 
phosphoramidite chemistry. Deprotection was done with 1 ml cone, ammonium 
hydroxide solution (Applied Biosystems) and final desalting was performed with a NAP 10 
column (Amersham Pharmacia Biotech). 

10 

For PCR 100 ng of chicken duodenal cDNA were taken as template and the 
following steps performed: 94°C 30"; 52°C 30"; 72°C 1' for 40 cycles. The resulting 
band of 51 bp was cut out from a 10% polyacrylamide gel, electroeluted on DEAE paper at 
300 V for 1 .5 hours, eluted from the DEAE paper once with 40 ul and twice with 30 ul 1 .5 
1 5 M NaCl, 5 mM Tris, 0.5 mM EDTA, precipitated with 2.5 volumes of ethanol 1 00% and 1 
jxg glycogen, washed with 0.5 ml of 80% ethanol, dried and dissolved in 20 uf TE (10 mM 
Tris, 1 mM EDTA). 

The resulting fragment of 5 1 bp was cloned into pGEM-T Easy, a commercially 
20 available T/A cloning vector (Promega Corporation, Madison). The corresponding cDNA 
sequence was determined by automated fluorescent sequencing on a Vistra DNA 
Sequencer 725 (Amersham Pharmacia Biotech). 

From the above DNA sequence a homologous forward primer was derived: 

25 

5' TCT GAATTC CGGAGCCCATAAAAGC 3' (primer dioxyl2) (Seq. ID No. 10) 

At the 5' end an EcoRI site (underlined sequence) was introduced; the following 17 
nucleotides are homologous to the previously obtained dioxygenase sequence. 

30 
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In a RT-PCR reaction a polyTYNot primer (commercially available from Invitrogen, 
San Diego) was used as reverse primer together with primer dioxyl2. 



One tube RT-PCR kit from Boehringer Mannheim was taken and the corresponding 
protocol followed: 



mix 1: 
18.3 |xl H 2 0 

2.5 ul DTT (100 mM) 

1.0 uldNTPs(lOmM) 

1.0 ul oligo dT/Not (0.2 ug/ul) (3' primer) 

1.0 ul dioxyl2 (5' primer) (20 uM) 

0.2 ul RNAse inhibitor (40 U/ul) 

1 .0 ul chicken duodenal total RNA (2.2 ug/ul) 



25.0 ul 

mix 2: 
14.0 ul H 2 0 

10.0 ul RT-PCR buffer 5x 

1.0 ul enzyme mix (AMV RT, Taq and Pwo DNA Polymerase) 



25.0 ul 



The 2 mixes were combined and the PCR protocol started on a MJ Research 
PTC200 DNA Engine. 



50°C 


30' 




94°C 


2' 




94°C 


30" 




57°C 


30" 


10 cycles 


68°C 


45" 




94°C 


30" 




62°C 


30" 


25 cycles 


68°C 


45" 


+ 3" /cycle 


68°C 


7' 





20 



4°C over night 



With this RT-PCR protocol a band of 597 bp was amplified from chicken total 
duodenal RNA. The PCR band was isolated from a 1% agarose gel, cloned into pGEM-T 
Easy cloning vector and subsequently sequenced. The original peptide is present in the 
sequence as well as an open reading frame over the whole sequence of 597 bp. 

C) Chicken cDNA-library: 

From chicken duodenal polyA + RNA cDNA was made with the Copy Kit 
(Invitrogen, San Diego) using a modified Gubler-Hoffman procedure. The cDNA was size- 
selected (0.9-5.5 kb) and cloned into the eukaryotic expression vector pcDNAl .1/Amp 
(Invitrogen). 

Electroporation into E. coli ToplO was done with a Bio-Rad Gene Pulser II system 
following the standard protocol. This resulted in a cDNA library of 480,000 individual 
clones. The library was split into 250 pools with 1500-2500 individual clones each. Each 
pool was amplified in 100 ml LB medium: Bacterial growth was stopped at OD 0.8-1 .0 by 
adding chloramphenicol to a final concentration of 170 (ag/ml. Incubation was continued 
over night in order to increase the DNA amount. 

D) Activity screening of the chicken cDNA library: 

90 of the above pools were tested for activity in a transactivation assay based on the 
detection of retinoic acid which is produced in eukaryotic cells after P-carotene cleavage. 
The principle of the activity test is shown in Fig. 2. 

5 ug of DNA from each pool were transfected with 20 ug of lipofectin (Life 
Technologies) into a reporter cell line bearing a luciferase reporter plasmid with a RARE 
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(retinoic acid response element) in front of the tk promoter (Herpes simplex thymidine 
kinase promoter). Transfections were done for 7 hours under serum free conditions. After 7 
hours the transfection mix was removed and RPMI medium with 1 0% charcoal treated 
FCS (fetal calf serum) was added. After 20 hours of incubation P-carotene (P-carotene 
5 10% CWS, F. Hoffmann-La Roche Ltd.) or a placebo formulation were added to the 

culture medium to a final P-Carotene concentration of 5 uM. Incubation was continued for 
1 8 hours. Then cells were washed with PBS, and luciferase expression was measured after 
substrate addition with a nitrogen cooled slow scan CCD camera (AstroCam Ltd.) 
Exposure time usually was 8 min. Analysis was done with the Image Pro Plus 3.0 software 
10 package (Media Cybernatic, Maryland). 3 pools were strongly positive,7 pools showed 
weaker, but detectable activity. 

One of the positive pools was plated on a square agar plate. 2 filters (nylon 
membranes, Gene Screen, NEN Research Products, Boston) were processed and screened 
1 5 with the radioactively ((a 32 P) dATP, Amersham) labeled 597 bp PCR-fragment. From 

9500 colonies screened, 14 were double positive. From 36 colonies picked, 5 showed the 
same pattern after restriction site analysis. 2 clones were sequenced from the 5' end and the 
original 5 1 bp sequence was found. Subsequently the whole cDNA was sequenced and 
confirmed twice. 

20 

All molecular biological procedures were done according to Sambrook, Fritsch and 
Maniatis, Laboratory Manual, Cold Spring Harbor Laboratory Press (1989), which is 
incorporated by reference as if recited in full herein, if not otherwise mentioned. 

25 The obtained cDNA sequence is shown in Figure 3 and the amino acid sequence 

deduced therefrom in Figure 4. 

Figure 4 shows the derived amino acid sequence having 526 residues. 

30 
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EXAMPLE 5 

Sequence comparison 

By sequence comparison with the EMBL Genbank a high homology between the 
known protein RPE65 (Hamel et al., J.Biol.Chem. (1993) p. 15751-15757) and the p\p- 
carotene 15,15' dioxygenase was found. A homology of 55.5% on the amino acid level 
was found. The sequence alignment is shown in Figure 5. 



EXAMPLE 6 

Expression of the cDNA for |3- ft -carotene lSJS'-dioxygenase in E. coli 

With PCRthe coding sequence of the ft -(3 -carotene 15, 15 '-dioxygenase cDNA was 
amplified and the resulting fragment of 1 578 bp was cloned into the EcoRI/BamHI site of 
the prokaryotic expression vector pQE-12 (Qiagen). The vector contains an in frame hexa- 
His affinity tag at the C-terminus of the dioxygenase. In addition, this plasmid contains a 
regulated promoter with two lad repressor binding sites. 



The E. coli strain M15pREP4 was transformed with the expression plasmid. For 
expression 1 1 LB medium, containing 100 pg/ml ampicillin and 25 ug/ml kanamycin, was 
inoculated with 30 ml of an overnight culture. Growth was allowed until OD 600 of 0.6-0.8 
25 was reached. At this point the culture was induced with 1 mM IPTG (isopropyl-|3- 

thiogalactoside) and growth continued for another 1.5-2 hours. Bacteria were harvested by 
centrifugation and the pellet was frozen at -80°C. 

The pellet was thawed by stirring in 20 ml extraction buffer (300 mM NaCl, 50 
30 mM NaH 2 P0 4 , 20 mM Tris-HCl, 1 mg/ml Tween 40; pH 7.8) including 2.5 mg/ml 

dodecyl-ft-D-maltoside. 1 ml protease inhibitor cocktail (5.9 mM benzamidine-HCl, 10 
mM 6-amino-caproic acid, 5 uM soybean trypsin inhibitor) was added at the same time. 
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The bacterial cells were lysed by a 4 minute treatment with a Polytron (Kinematica 
AG, Switzerland) using a PT7 unit following a 4 minute sonication with a Branson 
Sonifier 250. 

5 The lysate was spun at 12000 x g and the supernatant purified over a Co 2+ -chelate 

column (Talon Superflow Resin, Clontech, Heidelberg, Germany). The protein was eluted 
with 15-20 ml of extraction buffer containing 150 mM imidazole. 

The fractions were loaded on a 10% polyacrylamide gel and the protein containing 
10 fractions dialyzed against 150 mM tricine, 5 mM FeS0 4 , 3 mg/ml reduced glutathione, 

0.21 mg/ml sodium cholate. The gel is shown in Fig. 6. The samples were then assayed in 
the dioxygenase activity assay. 

15 EXAMPLE 7 

Expression of recombinant 
3-ft-carotene lSJS'-dioxygenase in the human duodenal cell line HuTu80 

20 With PCR the coding sequence of the |3-|3-carotene 15, 15 '-dioxygenase cDNA was 

amplified and the resulting fragment of 1578 bp was cloned into the BamHI/XhoI site of 
the plasmid pSFV 2 gen. This vector is part of the Semliki Forest Virus expression system, 
which works highly efficiently in most mammalian cells. 

25 The plasmid was used for in vitro synthesis of recombinant RNA, which was 

subsequently electroporated together with a helper virus into BHK cells (baby hamster 
kidney cells) for production of a high titer virus stock. With an aliquot of this stock the 
human duodenal cell line HuTu80 was infected. 16-18 hours after infection the cells were 
harvested and the pellet frozen at -80°C. Either the whole cell pellet, the cytoplasmic 

30 fraction or the membrane fraction was used in an activity assay. |3-£-carotene 15,15'- 

dioxygenase activity was found in the whole cell extract and in cytosolic fractions, while 
in the membrane fraction no activity was detected. 



24 



EXAMPLE 8 



5 (B - ^-carotene 15,15'-dioxygenase activity of the 

recombinant protein expressed in E. coli and in human cells 

After expression in E. coli and purification over a metal chelate column, the protein 
1 0 shows cleavage activity with (3-carotene as substrate. Retinal was the only product detected 
by HPLC after incubation with (3-carotene. No apocarotenals or other metabolites were 
found. This was proved by HPLC analysis as shown in Figs. 7 and 8. 

The invention being thus described, it will be obvious that the same may be varied 
15 in many ways. Such variations are not to be regarded as a departure from the spirit and 
scope of the invention and all such modifications are intended to be included within the 
scope of the following claims. 
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What Is Claimed Is : 



5 1 . An isolated polypeptide having p\P-carotene 15,1 5'-dioxygenase activity 
comprising SEQ ID NO: 1 or a polypeptide having p\|3-carotene 15,15'-dioxygenase 
activity and being at least 60% homologous to SEQ ID NO: 1 as determined by the 
Wisconsin Sequence Analysis Package GCG, Version 9.1 (1997). 

10 2. An isolated polypeptide according to claim 1 wherein the polypeptide is at least 
70% homologous to SEQ ID NO: 1. 

3. An isolated polypeptide according to claim 1 wherein the polypeptide is at least 
80% homologous to SEQ ID NO: 1 . 

15 

4. An isolated polypeptide according to claim 1 wherein the polypeptide is at least 
90% homologous to SEQ ID NO: 1. 

5. An isolated polypeptide according to claim 1 which is derived from chicken. 

20 

6. An isolated nucleic acid sequence encoding the polypeptide of claim 1 . 

7. An isolated nucleic acid sequence according to claim 6 which comprises SEQ ID 
NO: 2 or a fragment thereof. 

25 

8. An isolated nucleic acid sequence according to claim 7 wherein the fragment has at 
least 20 bases. 

9. An isolated nucleic acid sequence according to claim 7 wherein the fragment has at 
30 least 30 bases. 
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10. An isolated nucleic acid sequence according to claim 6 wherein the nucleic acid is a 
deoxyribonucleic acid. 



11. An isolated nucleic acid sequence according to claim 6 wherein the nucleic acid is 
5 an antisense ribonucleic acid. 

12. A primer for amplifying a gene coding for a polypeptide having P,P-carotene 
15,15-dioxygenase activity which primer comprises a fragment of the nucleic acid 
sequence according to claim 6. 

10 

13. A probe for detecting a gene coding for a polypeptide having p ? p-carotene 1 5,1 5'- 
dioxygenase activity which probe comprises a fragment of the nucleic acid sequence 
according to claim 6. 

15 14. A test kit for amplifying and/or detecting a gene or a fragment thereof coding for 
P,p-carotene 15,15'-dioxygenase wherein the test kit comprises at least one primer 
according to claim 12. 

15. A test kit for amplifying and/or detecting a gene or a fragment thereof coding for 
20 P,P-carotene 1 5,1 5'-dioxygenase wherein the test kit comprises at least one probe 

according to claim 13. 

16. An antibody which specifically reacts with a polypeptide according to claim 1 . 

25 17. An immunoassay for the detection and/or quantification of P,p-carotene 15,15'- 
dioxygenase which comprises at least one antibody according to claim 16. 

18. A process for the production of vitamin A comprising enzymatically cleaving P- 
carotene with a polypeptide according to claim 1 . 

30 
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19. A method for introducing a p\p-carotene 1 5,1 5'-dioxygenase cDNA into a host cell 
comprising introducing a cDNA coding for the polypeptide of claim 1 into a vector 
suitable for the host cell and introducing the vector into the host cell. 

20. A method according to claim 19 wherein the host cell is a plant cell. 

21. A method according to claim 1 9 wherein the host cell is a prokaryotic cell. 

22. A method according to claim 19 wherein the host cell is a yeast cell or a fungal cell. 

23 . A method according to claim 1 9 wherein the host cell is an alga cell. 

24. A method according to claim 19 wherein the host cell is a mammalian cell. 

25. A method according to claim 24 wherein the mammalian cell is a human cell. 

26. A host cell obtained by the method of claim 19. 

27. A host cell according to claim 26 which comprises a (3,p-carotene 15,15'- 
dioxygenase cDNA obtained from another species. 

27. An isolated polynucleotide which encodes p,p-carotene 15,1 5'-dioxygenase 
comprising SEQ ID NO: 2. 

28. An isolated polynucleotide according to claim 27 which consists essentially of SEQ 
ID NO: 2. 

29. An isolated polynucleotide according to claim 27 which consists of SEQ ID NO: 2. 

30. A vector comprising the polynucleotide of SEQ ID NO: 2. 

28 



31. 



A host cell transformed with the vector of claim 30. 



32. An isolated polypeptide having p,p-carotene 1 5,1 5'-dioxygenase activity 
5 comprising SEQ ID Nos: 1 or 4. 

33. A primer set for amplifying a polynucleotide encoding p,p-carotene 15,15'- 
dioxygenase comprising SEQ ID NO: 8 as a 5' primer and SEQ ID NO: 9 as a 3' primer. 

10 34. A primer set for amplifying a polynucleotide encoding p\(3-carotene 15,15'- 
dioxygenase comprising a polyT/Not reverse primer and SEQ ID NO: 10 as a forward 
primer. 

35. A kit for amplifying and/or detecting a polypeptide or fragment thereof encoding 
1 5 p,p-carotene 1 5, 1 5'-dioxygenase comprising at least one primer selected from the group 
consisting of SEQ ID Nos: 8, 9, and 10. 
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Attorney Docket No. 20347/1 1 1656 



ABSTRACT OF THE DISCLOSURE 

5 

Nucleic acid sequences coding for a protein having (3,P-carotene 15,15'- 
dioxygenase activity and their uses in diagnostics, the synthesis of vitamin A and methods 
for the introduction of the (3,p-carotene 15,15-dioxygenase cDNA into host cells are 
10 disclosed. 
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Figure 3 
No . 2 



Seq. 



1 CGGATCCACT AGTAAC GGC C GCCAGTGTGG TGGAATCCAT 
CCTTCTATGT 

51 AACAGGAAAG AGCTGTTCTT AGCCCAGAGA GGAGGGCACC 
GTACGCCTGC 

101 AGGAGCAGCT GGGTAGAGGA CACAGGAGAG C G ATGG AG AC 
AATATTTAAC 

151 AGAAACAAAG AAGAGCATCC AGAGCCCATA AAAGCTGAGG 
TGCAAGGTCA 

2 01 GTTGCCCACT TGGTTGCAAG GGGTACTTCT CCGAAATGGC 
CCAGGGATGC 

2 51 ACACAATAGG GGACACTAAA TACAACCACT GGTTTGATGG 
CTTGGCTCTG 

3 01 CTGCACAGCT TCACGTTTAA AAATGGTGAA GTTTACTACA 
GAAGTAAGTA 

3 51 CCTCCGAAGT GACACATACA ACTGCAATAT AGAAGCAAAC 
CGAATCGTGG 

401 TGTC TGAGTT TGGAACCATG GCTTATCCGG ATCCATGCAA 
AAACATATTT 

451 GCCAAGGCAT TCTCATACTT ATCTCACACC ATTCCTGAGT 
TCACGGACAA 

501 CTGCCTGATC AACATTATGA AAACTGGGGA TGATTATTAT 
GCTACCAGTG 

551 AGACTAACTT CATCAGAAAA ATTGATCCAC AGACTCTGGA 
GACACTAGAT 

601 AAGGT AG AC T ACAGCAAATA TGTAGC TGT A AACTTGGCAA 
CTTCTCACCC 

651 ACACTATGAC AGTGCTGGAA ATATTCTCAA CATGGGTACT 
TCAATTGTTG 

7 01 ATAAAGGGAG AAC AAAAT AT GTTCTCTTTA AGATCCCTTC 
CTCTGTACCA 

7 51 GAAAAAGAAA AGAAGAAATC TTGTTTTAAA CACCTGGAAG 
TAGTATGCTC 

801 CATCCCTTCT CGCTCCCTGC TCCAACCAAG CTACTACCAC 
AGCTTTGGAA 



851 TCACAGAAAA TTATATTGTG TTCATAGAGC AGCCATTTAA 
ACTGGATATT 



901 GTCAAACTGG CAACTGCCTA CATCCGAGGT GTGAAC TGGG 
CTTCCTGCCT 



951 TTCCTTTCAT AAGGAGGATA AGACGTGGTT TCACTTTGTA 
G AC AG AAAG A 



10 01 CGAAAAAAGA AGTATCCACC AAGTTTTAC A CTGATGCTTT 
GGTGCTTTAT 



1051 C AC C AC AT AA ATGCTTACGA AGAAGATGGC CACGTTGTTT 
TTGATATCGT 



1101 TGCCTACAGA GACAATAGCT TGTACGATAT GTTTTACTTA 
AAAAAACTGG 



1151 AC AAAGAC TT TGAAGTGAAC AACAAGCTTA CCTCCATCCC 
AACCTGCAAG 



12 01 CGCTTTGTTG TGCCTCTGCA GTATGACAAG GATGCAGAAG 
TAGGTTCTAA 



12 51 TTTAGTCAAA CTTCCAACTT CCGCAACTGC TGTAAAAGAA 
AAAGATGGCA 

13 01 GCATCTATTG TCAACCTGAA ATATTATGTG AAGGGATAGA 
ACTGCCTCGT 



1351 GTCAACTATG ACTACAATGG CAAAAAATAC AAGTATGTCT 
ATGCAACAGA 



14 01 AGTCCAGTGG AGCCCAGTTC CTACAAAGAT TGCAAAACTG 
AATGTCCAAA 



1451 CAAAGGAAGT ACTGCACTGG GGAGAAGACC ACTGCTGGCC 
CTCAGAGCCC 



15 01 ATCTTTGTTC CCAGCCCCGA TGCAAGAGAA GAGGATGAAG 
GTGTTGTTTT 



1551 GACCTGTGTT GTGGTGTCTG AGCCAAATAA AGCACCCTTC 
CTACTCATCT 



1601 TGGATGCTAA AACATTCAAA GAATTGGGCC GAGCCACAGT 
TAACGTAGAA 



1651 ATGCATCTGG ACCTGCATGG GATGTTTATA CCACAGAATG 
ATTTGGGGGC 

17 01 TGAGACGGAA TAAAACGCTA TTGATCCGAC TACACAAACT 
GAGACAACTT 



17 51 TCTACTGAAC ATGAGTTAAT ATCCCTTTTA CCATTCAAGA 
ACAACCATAT 



18 01 AACGACACAA AATGACTATG TATAATCTCT TAAATAATAG 
ATATAATCCT 



1851 TTTAAGGCAC AGC GATGAGT TTTACTACAG GTAACGATAT 
GCACAACTGG 



19 01 CATATAACTA TTCCAAAAGA AGAAGAACGA TCAGTGTTTT 
AGAAGTGCTA 



1951 ATGTTGT AC A TAACGGCGGC AGAGGGAACA GGAGAGAAAG 
GTAACGGGAA 

2 001 TATTTAATAG AATATAGATT TCTGAGCAAA TGAAGTGCAG 
TATTTATGGT 

2 051 GTGATGCATG GCATGAGTCA CATAGGTCTG CAGCTCATGT 
ATC TTTTAG A 

2101 GATCGTTTCA AGATTGCAGC TTGTGATGCA AGTTTTCTCC 
AGCCAGAAAA 

2151 CCTCATTTTA AACCATCTGC TACTGGTAAT TCATACCAAT 
GCATTTTCTT 

22 01 GGTGCTCGAT TTACACTATA AC C AAAGTT A AGTATTACAT 
TCAGGTGCTA 

22 51 CAACTTTCTA ATTTACAACC GAAACAAACA AGCAAACAGC 
ACTTGCTTTG 

23 01 CTAATAACCC CATGGTGTAT TTTTCCTTTT TATGATGACA 
AAACCAAGTA 

23 51 CATATGGTTT TATGTAGCAT TCAATTATAC TTCAGTGCTA 
TTCCATCCTA 

2 401 ATGTTATAAG CAATTTGTAT TTAAATCAGT TTTCCTTGAG 
AATATCTGAC 

2451 ATAACATTTT GTGTAATGAG ATGACTATGT TGTCTAAAGA 
TGAACAGGAA 

2 5 01 TGTATCTTTT ATTAGTATTG TTAATTGTGT TACTAATACT 
ATGCATATGA 

2 5 51 ATGAGAGCAA TGTATTTCTA GGAGAACTCA GATATACATT 
CAACAATTTC 

2 6 01 TGTAGGTGAA AATGCATTTA CTGATGAAAG TTGAATCGTT 
AATGAGGGAG 

2 651 AAAACTGGGT ATCCATCCAT CCAACTATGT TAGGTGTTCA 
CCTGGTCTGT 

2 7 01 ATGTGACACC ACGCTGTTTG GGTATCTCTC ACTTTCACAT 
ACCTGTTCTC 

2 7 51 ATGGTTTCTG CTACTCACTG TATTTTGCAG GAGAGAAACA 
AAATGAAATC 



2 801 ACTGTCACTT ACTATCGCCC CATCACATAA GAACAATGGG 
GCTTTGGTGA 



2 851 CTTGTTCATG ATTACATAAG ATGTTTGCAG CAGAGCAGCA 
ATAGAACCAA 

29 01 CACCATCCAC AGTTCTTGCT TGCTCTGTTA TGACTCCCTT 
TGCTGTCTTT 

2951 ATGGTTTGCA TGTATGAAGA ATACACTGCC TAATTCTAAT 
GTTAAAAAGT 

3 0 01 CACTGGGGTC AGATCTAGAG CTTAAGTAAG CAGTCTGGGG 
TTTTCAAATG 

3 051 TTTATATGTT C C AT AAAATG GAAATAAACA CCTCCATAAT 
AAAAAAAAAA 

3101 AAAAAAAAAA A 



Figure 4 
No. 1 



Seq. 



1 ME T I FNRNKE EHPEPIKAEV QGQLPTWLQG VLLRNGPGMH 
TIGDTKYNHW 

51 FDGLALLHSF TFKNGEVYYR SKYLRSDTYN CNIEANRIW 
SEFGTMAYPD 

101 PCKNIFAKAF SYLSHTIPEF TDNCLINIMK TGDDYYATSE 
TNFIRKIDPQ 

151 TLETLDKVDY SKYVAVNLAT SHPHYDSAGN ILNMGTSIVD 
KGRTKYVLFK 

2 01 IPSSVPEKEK KKSCFKHLEV VCSIPSRSLL QPSYYHSFGI 
TENYIVFIEQ 

2 51 PFKLDIVKLA TAYIRGVNWA SCLSFHKEDK TWFHFVDRKT 
KKEVSTKFYT 

3 01 DALVLYHHIN AYEEDGHWF D I VAYRDNS L YDMFYLKKLD 
KDFEVNNKLT 

3 51 SIPTCKRFW PLQYDKDAEV GSNLVKLPTS ATAVKEKDGS 
IYCQPEILCE 

4 01 GIELPRVNYD YNGKKYKYVY ATEVQWSPVP TKIAKLNVQT 
KEVLHWGEDH 

451 CWPSEPIFVP SPDAREEDEG WLTCVWSE PNKAPFLLIL 
DAKTFKELGR 

5 01 ATVNVEMHLD LHGMFIPQND LGAETE 



Figure 5 



Seq ID No. 4 and Seq ID No. 



1 0 EEHPEPIKAEVQGQLPTWLQGVLLR . . NGPGMHTIGDTKYNHWFDGLALL 

II |: | | |. = | II I III llh =1 = I Mi III 

2 0 EELSSPLTAHVTGRIPLWLTGSLLRCFTGPGLFEVGSEPFYHLFDGQALL 



5 8 HSFTFKNGEVYYRSKYLRSDTYNCNIEANRIWSEFG. . TMAYPDPCKNI 

I I II I I I :::|-l I ■ Ill-Ill I 1 = 1111111 
7 0 HKFDFKEGHVTYHRRFIRTDAYVRAMTEKRIVITEFGFTTCAFPDPCKNI 



10 6 FAKAFSYLSHTIPEFTDNCLINIMKTGDDYYATSETNFIRKIDPQTLETL 

I-: Ml I III 1 = 1= 1 = 1111 -11111 I I •1 = 1111 = 

12 0 FSRFFSYFRGV. . EVTDNALVNVYPVGEDYYACTETNFITKINPETLETI 



15 6 . . D KVD Y S KY VAVNL AT S H P H YD S AGN I LNMGT S IVDKGRTKYVLFKI PS 

■II ||-|| ll-lll =-1 = 1-1 I = Ml 

16 8 FTKQVDLCNYVSVNGATAHPHIENDGTVYNIGNCFGKNFSIAYNIVKIPP 



2 04 SVPEKEKKKSCFKHLEWCSIPSRSLLQPSYYHSFGITENYIVFIEQPFK 

:|| II 1 = 1 I -III lllhl 11111 = 1 I I 

218 LQADKEDPISKFTS . EIWQFPCSDRFKPSYVHSFGLTPNYIVFVETPVK 



2 54 LDIVKLATAY . IRGVNWASCL . SFHKEDK . TWFHFVDRKTKKEVSTKFYT 

:•: | ..: : | |: 1 || | II hi II 1= I 

2 67 INLFKFLSSWSLWGANYMDCFESFTNETMGWLHIADKKRKKYLNNKYRT 



3 01 DALVLYHHINAYEEDGHWFDIVAYRDNSL . . . YDMFYLKKLDKDFE . . . 

1 = 1111 ll = -l -= 1= == I II I -.-I 

317 SPFNLFHHINTYEDNGFLIVDLCCWKGFEFVYNYFTLYLANLRENWEEVK 



3 45 VNNKLTSIPTCKRFWPLQYDKDAEVGSNLVKLP . TSATAV . . KEKDGSI 

I : I :|:|.|| II h I I I 1 II 1-111= =■ -I 

3 67 KNARKAPQPEVRRYVLPLNIDK . ADTGKNLVTLPNTTATAILCSDEFTTI 



3 92 YCQPEILCEG. . . . IELPRVNYD . YNGKKYKYVYATEVQWSPVPTKIAKL 

: :|hl I I l- = ll I II I I I - II == II 

416 WLEPEVLFSGPRQAFEFPQINYQKYCGKPYTYAYGLGLNHF . VPDRLCKL 



-9- 



437 NVQTKEVLH . . WGEDHCWPSEPIFVPSPDAREEDEGWLTCVWSEPNKA 

484 

I I -I II II :|llllll III III: MM- I! 

4 6 5 NVKTKETWFTVWQEPDSYPSEPIFVSHPDALEEDDGWLSVWSPGAGQK 



514 



485 P.FLLILDAKTFKELGRA. . TVNVEMHLDLHGMF 515 

I :||||.|| I- II II : • • Ihl 

515 PAYL L I LNAKDL S E VARAEFTVE INI PVTF HGL F 54 8 



- 10- 



66 kD 

— 45 kD 



Fig. 6 shows a 10% polyacrylamide gel with E.coli expressed p,(3- 
carotene 15 , 15 ' -dioxygenase after affinity tag purification; lane 
1 and lane 2 : 2 fractions from the Co 2+ -chelate column showing the 
main band at 60 kD; lane 3: low range molecular weight marker (Bio 
Rad) . 



Fig. 7 shows an HPLC profile of the reaction mixture at the end of an 
actvity assay for the (3,|3-carotene 15 , 15 ' -dioxygenase following the 
procedure in example 1 . The first peak in the chromatogram represents 
the internal standard, while the second peak corresponds to retinal 
as the only product formed during the central cleavage with f$- 
carotene as substrate. 




Fig. 8 confirms that the product peak in Fig. 7 is indeed 
retinal. A sample which was positive in the activity assay 
(green chroma to gram) was spiked with retinal and analysed 
in second HPLC run (red chromatogram) . The chromatograms 
of the two runs were then overlayed. 
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SEQUENCE LISTING 



<110> BACHMANN, Heinrich 
BRUGGER, Roland 
FRIEDLEIN, Arno M 
WIRTZ, Gabriele M 
WOGGON, Wolf -Dietrich 
WYSS, Adrian 
WYSS, Markus 

<120> BETA, BETA- CAROTENE 15 , 15 ' -DIOXYGENASES , NUCLEIC ACID 
SEQUENCES CODING THEREFOR AND THEIR USE 

<130> B , B - CAROTENE 15 , 15 ' -DIOXYGENASES ,.. . 

<140> 
<141> 

<150> 103382.0 
<151> 1999-02-22 

<1S0> 10 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 526 
<212> PRT 
<213> CHICKEN 

<400> 1 

Met Glu Thr lie Phe Asn Arg Asn Lys Glu Glu His Pro Glu Pro lie 
15 10 15 

Lys Ala Glu Val Gin Gly Gin Leu Pro Thr Trp Leu Gin Gly Val Leu 
20 25 30 

Leu Arg Asn Gly Pro Gly Met His Thr lie Gly Asp Thr Lys Tyr Asn 
35 40 45 

His Trp Phe Asp Gly Leu Ala Leu Leu His Ser Phe Thr Phe Lys Asn 
50 55 60 

Gly Glu Val Tyr Tyr Arg Ser Lys Tyr Leu Arg Ser Asp Thr Tyr Asn 
65 70 75 80 

Cys Asn lie Glu Ala Asn Arg lie Val Val Ser Glu Phe Gly Thr Met 
85 90 95 



1 



Ala Tyr Pro Asp Pro Cys Lys Asn lie Phe Ala Lys Ala Phe Ser Tyr 
100 105 110 



Leu Ser His Thr He Pro Glu Phe Thr Asp Asn Cys Leu He Asn He 
115 120 125 

Met Lys Thr Gly Asp Asp Tyr Tyr Ala Thr Ser Glu Thr Asn Phe He 
130 135 140 

Arg Lys He Asp Pro Gin Thr Leu Glu Thr Leu Asp Lys Val Asp Tyr 
145 150 155 160 

Ser Lys Tyr Val Ala Val Asn Leu Ala Thr Ser His Pro His Tyr Asp 
165 170 175 

Ser Ala Gly Asn He Leu' Asn Met Gly Thr Ser He Val Asp Lys Gly 
180 185 190 

Arg Thr Lys Tyr Val Leu Phe Lys He Pro Ser Ser Val Pro Glu Lys 
195 200 205 

Glu Lys Lys Lys Ser Cys Phe Lys His Leu Glu Val Val Cys Ser He 
210 215 220 

Pro Ser Arg Ser Leu Leu Gin Pro Ser Tyr Tyr His Ser Phe Gly He 
225 230 235 240 

Thr Glu Asn Tyr He Val Phe He Glu Gin Pro Phe Lys Leu Asp He 
245 250 255 

Val Lys Leu Ala Thr Ala Tyr He Arg Gly Val Asn Trp Ala Ser Cys 
260 265 270 

Leu Ser Phe His Lys Glu Asp Lys Thr Trp Phe His Phe Val Asp Arg 
275 280 285 

Lys Thr Lys Lys Glu Val Ser Thr Lys Phe Tyr Thr Asp Ala Leu Val 
290 295 300 

Leu Tyr His His He Asn Ala Tyr Glu Glu Asp Gly His Val Val Phe 
305 310 315 320 

Asp He Val Ala Tyr Arg Asp Asn Ser Leu Tyr Asp Met Phe Tyr Leu 
325 330 335 

Lys Lys Leu Asp Lys Asp Phe Glu Val Asn Asn Lys Leu Thr Ser He 
340 345 350 



2 



Pro Thr Cys Lys Arg Phe Val Val Pro Leu Gin Tyr Asp Lys Asp Ala 
355 360 365 



Glu Val Gly Ser 
370 

Lys Glu Lys Asp 
385 

Gly lie Glu Leu 



Lys Tyr Val Tyr 
420 

He Ala Lys Leu 
435 

Asp His Cys Trp 
450 

Arg Glu Glu Asp 
465 

Pro Asn Lys Ala 



Glu Leu Gly Arg 
500 

Gly Met Phe He 
515 



Asn. Leu Val Lys 
375 

Gly Ser He Tyr 
390 

Pro Arg Val Asn 
405 

Ala Thr Glu Val 



Asn Val Gin Thr 
440 

Pro Ser Glu Pro 
455 

Glu Gly Val Val 
470 

Pro Phe Leu Leu 
485 

Ala Thr Val Asn 



Pro Gin Asn Asp 
520 



Leu Pro Thr Ser 
380 

Cys Gin Pro Glu 
395 

Tyr Asp Tyr Asn 
410 

Gin Trp Ser Pro 
425 

Lys Glu Val Leu 



He Phe Val Pro 
460 

Leu Thr Cys Val 
475 

lie Leu Asp Ala 
490 

Val Glu Met His 
505 

Leu Gly Ala Glu 



Ala Thr Ala Val 



He Leu Cys Glu 
400 

Gly Lys Lys Tyr 
415 

Val Pro Thr Lys 
430 

His Trp Gly Glu 
445 

Ser Pro Asp Ala 



Val Val Ser Glu 
480 

Lys Thr Phe Lys 
495 

Leu Asp Leu His 
510 

Thr Glu 
525 



<210> 2 
<211> 3111 
<212> DNA 
<213> CHICKEN 

<400> 2 

cggatccact agtaacggcc gccagtgtgg tggaatccat ccttctatgt aacaggaaag 60 
agctgttctt agcccagaga ggagggcacc gtacgcctgc aggagcagct gggtagagga 12 0 
cacaggagag cgatggagac aatatttaac agaaacaaag aagagcatcc agagcccata 180 
aaagctgagg tgcaaggtca gttgcccact tggttgcaag gggtacttct ccgaaatggc 240 
ccagggatgc acacaatagg ggacactaaa tacaaccact ggtttgatgg cttggctctg 300 
ctgcacagct tcacgtttaa aaatggtgaa gtttactaca gaagtaagta cctccgaagt 3 60 
gacacataca actgcaatat agaagcaaac cgaatcgtgg tgtctgagtt tggaaccatg 42 0 



3 



gcttatccgg atccatgcaa aaacatattt gccaaggcat tctcatactt atctcacacc 480 
attcctgagt tcacggacaa ctgcctgatc aacattatga aaactgggga tgattattat 540 
gctaccagtg agactaactt catcagaaaa attgatccac agactctgga gacactagat 600 
aaggtagact acagcaaata tgtagctgta aacttggcaa cttctcaccc acactatgac 660 
agtgctggaa atattctcaa catgggtact tcaattgttg ataaagggag aacaaaatat 720 
gttctcttta agatcccttc ctctgtacca gaaaaagaaa agaagaaatc ttgttttaaa 780 
cacctggaag tagtatgctc catcccttct cgctccctgc tccaaccaag ctactaccac 840 
agctttggaa tcacagaaaa ttatattgtg ttcatagagc agccatttaa actggatatt 900 
gtcaaactgg caactgccta catccgaggt gtgaactggg cttcctgcct ttcctttcat 960 
aaggaggata agacgtggtt tcactttgta gacagaaaga cgaaaaaaga agtatccacc 1020 
aagttttaca ctgatgcttt ggtgctttat caccacataa atgcttacga agaagatggc 1080 
cacgttgttt ttgatatcgt tgcctacaga gacaatagct tgtacgatat gttttactta 1140 
aaaaaactgg acaaagactt tgaagtgaac aacaagctta cctccatccc aacctgcaag 12 00 
cgctttgttg tgcctctgca gtatgacaag gatgcagaag taggttctaa tttagtcaaa 1260 
cttccaactt ccgcaactgc tgtaaaagaa aaagatggca gcatctattg tcaacctgaa 1320 
atattatgtg aagggataga actgcctcgt gtcaactatg actacaatgg caaaaaatac 1380 
aagtatgtct atgcaacaga agtccagtgg agcccagttc ctacaaagat tgcaaaactg 1440 
aatgtccaaa caaaggaagt actgcactgg ggagaagacc actgctggcc ctcagagccc 1500 
atctttgttc ccagccccga tgcaagagaa gaggatgaag gtgttgtttt gacctgtgtt 1560 
gtggtgtctg agccaaataa agcacccttc ctactcatct tggatgctaa aacattcaaa 1620 
gaattgggcc gagccacagt taacgtagaa atgcatctgg acctgcatgg gatgtttata 1680 
ccacagaatg atttgggggc tgagacggaa taaaacgcta ttgatccgac tacacaaact 1740 
gagacaactt tctactgaac atgagttaat atccctttta ccattcaaga acaaccatat 1800 
aacgacacaa aatgactatg tataatctct taaataatag atataatcct tttaaggcac 1860 
agcgatgagt tttactacag gtaacgatat gcacaactgg catataacta ttccaaaaga 1920 
agaagaacga tcagtgtttt agaagtgcta atgttgtaca taacggcggc agagggaaca 1980 
ggagagaaag gtaacgggaa tatttaatag aatatagatt tctgagcaaa tgaagtgcag 2 040 
tatttatggt gtgatgcatg gcatgagtca cataggtctg cagctcatgt atcttttaga 2100 
gatcgtttca agattgcagc ttgtgatgca agttttctcc agccagaaaa cctcatttta 2160 
aaccatctgc tactggtaat tcataccaat gcattttctt ggtgctcgat ttacactata 2220 
accaaagtta agtattacat tcaggtgcta caactttcta atttacaacc gaaacaaaca 2280 
agcaaacagc acttgctttg ctaataaccc catggtgtat ttttcctttt tatgatgaca 2 340 
aaaccaagta catatggttt tatgtagcat tcaattatac ttcagtgcta ttccatccta 2400 
atgttataag caatttgtat ttaaatcagt tttccttgag aatatctgac ataacatttt 2460 
gtgtaatgag atgactatgt tgtctaaaga tgaacaggaa tgtatctttt attagtattg 2520 
ttaattgtgt tactaatact atgcatatga atgagagcaa tgtatttcta ggagaactca 2 58 0 
gatatacatt caacaatttc tgtaggtgaa aatgcattta ctgatgaaag ttgaatcgtt 2640 
aatgagggag aaaactgggt atccatccat ccaactatgt taggtgttca cctggtctgt 2700 
atgtgacacc acgctgtttg ggtatctctc actttcacat acctgttctc atggtttctg 2 760 
ctactcactg tattttgcag gagagaaaca aaatgaaatc actgtcactt actatcgccc 2 820 
catcacataa gaacaatggg gctttggtga cttgttcatg attacataag atgtttgcag 2 880 
cagagcagca atagaaccaa caccatccac agttcttgct tgctctgtta tgactccctt 2940 
tgctgtcttt atggtttgca tgtatgaaga atacactgcc taattctaat gttaaaaagt 3 000 
cactggggtc agatctagag cttaagtaag cagtctgggg ttttcaaatg tttatatgtt 3 060 
ccataaaatg gaaataaaca cctccataat aaaaaaaaaa aaaaaaaaaa a 3111 



<210> 3 



4 



<211> 8 
<212> PRT 
<213> CHICKEN 

<400> 3 

Ala Glu Val Gin Gly Gin Leu Pro 
1 5 



<210> 4 
<211> 506 
<212> PRT 
<213> CHICKEN 

<400> 4 

Glu Glu His Pro Glu Pro lie Lys Ala Glu Val Gin Gly Gin Leu Pro 
15 10 15 

Thr Trp Leu Gin Gly Val Leu Leu Arg Asn Gly Pro Gly Met His Thr 



lie Gly Asp Thr Lys Tyr Asn His Trp Phe Asp Gly Leu Ala Leu Leu 



His Ser Phe Thr Phe Lys Asn Gly Glu Val Tyr Tyr Arg Ser Lys Tyr 



Leu Arg Ser Asp Thr Tyr Asn Cys Asn lie Glu Ala Asn Arg lie Val 



Val Ser Glu Phe Gly Thr Met Ala Tyr Pro Asp Pro Cys Lys Asn lie 



Phe Ala Lys Ala Phe Ser Tyr Leu Ser His Thr lie Pro Glu Phe Thr 
100 105 110 

Asp Asn Cys Leu lie Asn lie Met Lys Thr Gly Asp Asp Tyr Tyr Ala 
115 120 125 

Thr Ser Glu Thr Asn Phe lie Arg Lys lie Asp Pro Gin Thr Leu Glu 
130 135 140 

Thr Leu Asp Lys Val Asp Tyr Ser Lys Tyr Val Ala Val Asn Leu Ala 
145 150 155 160 

Thr Ser His Pro His Tyr Asp Ser Ala Gly Asn lie Leu Asn Met Gly 
165 170 175 



5 



Thr Ser He Val Asp Lys Gly Arg Thr Lys Tyr Val Leu Phe Lys He 
180 185 190 

Pro Ser Ser Val Pro Glu Lys Glu Lys Lys Lys Ser Cys Phe Lys His 
195 200 205 

Leu Glu Val Val Cys Ser He Pro Ser Arg Ser Leu Leu Gin Pro Ser 
210 215 220 

Tyr Tyr His Ser Phe Gly He Thr Glu Asn Tyr He Val Phe He Glu 
225 230 235 240 

Gin Pro Phe Lys Leu Asp He Val Lys Leu Ala Thr Ala Tyr He Arg 
245 250 255 

Gly Val Asn Trp Ala Ser Cys Leu Ser Phe His Lys Glu Asp Lys Thr 
260 265 270 

Trp Phe His Phe Val Asp Arg Lys Thr Lys Lys Glu Val Ser Thr Lys 
275 280 285 

Phe Tyr Thr Asp Ala Leu Val Leu Tyr His His He Asn Ala Tyr Glu 
290 295 300 

Glu Asp Gly His Val Val Phe Asp He Val Ala Tyr Arg Asp Asn Ser 
305 310 315 320 

Leu Tyr Asp Met Phe Tyr Leu Lys Lys Leu Asp Lys Asp Phe Glu Val 
325 330 335 

Asn Asn Lys Leu Thr Ser He Pro Thr Cys Lys Arg Phe Val Val Pro 
340 345 350 

Leu Gin Tyr Asp Lys Asp Ala Glu Val Gly Ser Asn Leu Val Lys Leu 
355 360 365 

Pro Thr Ser Ala Thr Ala Val Lys Glu Lys Asp Gly Ser He Tyr Cys 
370 375 380 



Gin Pro Glu He Leu Cys Glu Gly 
385 390 

Asp Tyr Asn Gly Lys Lys Tyr Lys 
405 

Trp Ser Pro Val Pro Thr Lys He 
420 



He Glu Leu Pro Arg Val Asn Tyr 
395 400 

Tyr Val Tyr Ala Thr Glu Val Gin 
410 415 

Ala Lys Leu Asn Val Gin Thr Lys 
425 430 



6 



Glu Val Leu His Trp Gly Glu Asp 
435 440 

Phe Val Pro Ser Pro Asp Ala Arg 
450 455 

Thr Cys Val Val Val Ser Glu Pro 
465 470 

Leu Asp Ala Lys Thr Phe Lys Glu 
485 

Glu Met His Leu Asp Leu His Gly 
500 



His Cys Trp Pro Ser Glu Pro lie 
445 

Glu Glu Asp Glu Gly Val Val Leu 
460 

Asn Lys Ala Pro Phe Leu Leu lie 
475 480 

Leu Gly Arg Ala Thr Val Asn Val 
490 495 

Met Phe 
505 



<210> 5 
<211> 529 
<212> PRT 
<213> BOVINE 

<400> 5 

Glu Glu Leu Ser 



Leu Trp Leu Thr 
20 

Phe Glu Val Gly 
35 

Leu Leu His Lys 
50 

Arg Phe lie Arg 
65 

He Val He Thr 



Cys Lys Asn He 
100 

Val Thr Asp Asn 
115 

Tyr Ala Cys Thr 
130 



Ser Pro Leu Thr 



Gly Ser Leu Leu 



Ser Glu Pro Phe 
40 

Phe Asp Phe Lys 
55 

Thr Asp Ala Tyr 
70 

Glu Phe Gly Phe 
85 

Phe Ser Arg Phe 



Ala Leu Val Asn 
12 0 

Glu Thr Asn Phe 
135 



Ala His Val Thr 
10 

Arg Cys Phe Thr 
25 

Tyr His Leu Phe 



Glu Gly His Val 
60 

Val Arg Ala Met 

75 

Thr Thr Cys Ala 
90 

Phe Ser Tyr Phe 
105 

Val Tyr Pro Val 



He Thr Lys He 
140 



Gly Arg lie Pro 
15 

Gly Pro Gly Leu 
30 

Asp Gly Gin Ala 
45 

Thr Tyr His Arg 



Thr Glu Lys Arg 
80 

Phe Pro Asp Pro 
95 

Arg Gly Val Glu 
110 

Gly Glu Asp Tyr 
12 5 

Asn Pro Glu Thr 



7 



Leu Glu Thr lie Phe Thr Lys Gin Val Asp Leu Cys Asn Tyr Val Ser 
145 150 155 160 



Val Asn Gly Ala Thr Ala His Pro His He Glu Asn Asp Gly Thr Val 
1S5 170 175 

Tyr Asn He Gly Asn Cys Phe Gly Lys Asn Phe Ser He Ala Tyr Asn 
180 185 190 

He Val Lys He Pro Pro Leu Gin Ala Asp Lys Glu Asp Pro He Ser 
195 200 205 

Lys Phe Thr Ser Glu He Val Val Gin Phe Pro Cys Ser Asp Arg Phe 
210 215 220 

Lys Pro Ser Tyr Val His Ser Phe Gly Leu Thr Pro Asn Tyr He Val 
225 230 235 240 

Phe Val Glu Thr Pro Val Lys He Asn Leu Phe Lys Phe Leu Ser Ser 
245 250 255 

Trp Ser Leu Trp Gly Ala Asn Tyr Met Asp Cys Phe Glu Ser Phe Thr 
260 265 270 

Asn Glu Thr Met Gly Val Trp Leu His He Ala Asp Lys Lys Arg Lys 
275 280 285 

Lys Tyr Leu Asn Asn Lys Tyr Arg Thr Ser Pro Phe Asn Leu Phe His 
290 295 300 

His He Asn Thr Tyr Glu Asp Asn Gly Phe Leu He Val Asp Leu Cys 
305 310 315 320 

Cys Trp Lys Gly Phe Glu Phe Val Tyr Asn Tyr Phe Thr Leu Tyr Leu 
325 330 335 

Ala Asn Leu Arg Glu Asn Trp Glu Glu Val Lys Lys Asn Ala Arg Lys 
340 345 350 

Ala Pro Gin Pro Glu Val Arg Arg Tyr Val Leu Pro Leu Asn He Asp 
355 360 365 

Lys Ala Asp Thr Gly Lys Asn Leu Val Thr Leu Pro Asn Thr Thr Ala 
370 375 380 



Thr Ala He Leu Cys Ser Asp Glu Phe Thr Thr He Trp Leu Glu Pro 
385 390 395 400 



Glu Val Leu Phe 



Asn Tyr Gin Lys 
420 

Gly Leu Asn His 
435 

Thr Lys Glu Thr 
450 

Ser Glu Pro lie 
465 

Gly Val Val Leu 



Ala Tyr Leu Leu 
500 

Ala Glu Phe Thr 
515 



Ser Gly Pro Arg 
405 

Tyr Cys Gly Lys 



Phe Val Pro Asp 
440 

Trp Phe Thr Val 
455 

Phe Val Ser His 
470 

Ser Val Val Val 
485 

lie Leu Asn Ala 



Val Glu lie Asn 
520 



Gin Ala Phe Glu 
410 

Pro Tyr Thr Tyr 
425 

Arg Leu Cys Lys 



Trp Gin Glu Pro 
460 

Pro Asp Ala Leu 
475 

Ser Pro Gly Ala 
490 

Lys Asp Leu Ser 
505 

lie Pro Val Thr 



Phe Pro Gin lie 
415 

Ala Tyr Gly Leu 
430 

Leu Asn Val Lys 
445 

Asp Ser Tyr Pro 



Glu Glu Asp Asp 
480 

Gly Gin Lys Pro 
495 

Glu Val Ala Arg 
510 

Phe His Gly Leu 
525 



Phe 



<210> 6 
<211> 18 
<212> PRT 
<213> CHICKEN 

<400> 6 

Asn Lys Glu Glu His Pro Glu Pro lie Lys Ala Glu Val Gin Gly Gin 
15 10 15 

Leu Pro 



<210> 7 

<211> 18 

<212> PRT 

<213> CHICKEN 

<400> 7 



9 



Asn Lys Glu Glu His Pro Glu Pro lie Lys Ala Glu Val Gin Gly Gin 
15 10 15 



Leu Pro 



<210> 8 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<220> 

<221> modif ied_base 
<222> (18) 
<223> i 

<400> 8 

aacaargarg ascayccnga 2 0 



<210> 9 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<220> 

<221> modif ied_base 
<222> (7) 
<223> i 

<220> 

<221> modif ied_base 
<222> (13) 
<223> i 

<400> 9 

sagctgnccc tgnacytcsg c 21 



<210> 10 
<211> 25 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
<400> 10 

tctgaattcc ggagcccata aaagc 25 
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